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Abstract 


Let  T  = 


be  a  square,  nonsingular  matrix.  We  give  explicit  bounds,  accurate  to  within 


a  factor  of  at  most  3"^,  for  the  distance  from  T  to  the  nearest  singular  matrix  where  only  1,  2 
or  3  of  r's  subblocks  may  be  perturbed.  The  bounds  are  in  terms  of  the  norms  of  subblocks 
of  7""'.  These  results  hold  for  a  large  class  of  matrix  norms,  and  we  present  an  inexpensive 
algorithm  for  estimating  these  bounds  for  the  l-norm  and  infinity-norm  without  explicitly 
computing  7~'.  We  extend  some  of  these  results  to  block  3  by  3  matrices.  These  results 
have  application  in  W*  control  theory  and  stability  analysis  of  various  problems  in  linear 
algebra. 


1.  Introduction 
Let 


T  = 


A    B 
C   D 


(1.1) 


be  a  square  nonsingular  matrix.  We  are  interested  in  finding  the  nearest  singular  matrix  to  T 
where  only  some  of  the  subblocks  A,  B,  C  and  D  may  be  perturbed.  Our  answers  will  be 
expressed  in  terms  of  the  norms  of  the  blocks  of 


r-i  = 


w  X 

Y    Z 


(1.2) 


where  T~^  and  T  are  conformally  partitioned  (i.e.  A  and  W^  have  the  same  dimensions,  C 
and  X^  have  the  same  dimensions,  and  so  on).  In  particular,  we  are  interested  in  the  follow- 
ing three  problems: 

1)      What  is  the  smallest  ||8D||  such  that 

A        C 

B   D  +  hD 

is  singular?  The  answer  will  turn  out  to  be 


min  II8DII  = 


2) 


||-||  may  be  an  arbitrary  operator  norm  ||Z||  =  sup  ||Zx||/||x||.    Note  that  ||Z||  may  be 

zero;  in  this  case  no  &D  makes  the  matrix  singular.  By  permuting  rows  and  columns, 
we  see  that  the  solution  to  this  problem  includes  the  cases  where  any  other  single  block 
alone  (A  or  B  or  C)  is  to  be  perturbed,  as  well  as  the  case  of  the  smallest  |16D||  such  that 
[C  \D  +  bD]  is  singular  (let  A,  B,  W  and  Y  be  null  matrices).  When  A,  B  and  C  are  null 
so  that  T  =  D  and  7"~'=Z,  this  result  reduces  to  the  well-known  result  that  the  distance 
from  T  to  the  nearest  singular  matrix  is  ||r" 

What  is  the  smallest  ||diag(6A,6D)||  such  that 


•iii-i 


A+6A        C 
B        D  +  hD 


is  singular?  We  will  show 


—  s  min  ||diag(8A,6D)||  <  - — 


where 


K  =  max  [||Z||  ,  \\W\\  ,  (||X|M|y| 


\V2^ 


where  ||-||  may  belong  to  a  large  class  of  norms  including  all  p-norms.  When  T  is  real, 
we  prove  a  slightly  stronger  result  where  the  upper  and  lower  bounds  can  differ  by  a 
factor  of  at  most  2.  By  permuting  rows  or  columns,  we  see  this  solution  also  solves  the 
case  where  B  and  C  are  perturbed,  and  A  and  D  fixed.  We  also  consider  a  related  prob- 
lem arising  in  the  equality  constrained  quadratic  programming  problem,  where  both  T 
and  its  perturbation  must  be  symmetric. 

5A    6B 


3)       What  is  the  smallest 


0     &D 


such  that 

A+bA    B+hB 
C        D  +  bD 


is  singular?  We  will  show 


-^  s  min  ||diag(6A,6D)| 


ll/2 


where 


lYwy^] 


K  =  max  [1|Z||  ,  \\W\\  ,  \\Y\\  ,  (\\X\ 

where  ||-||  belongs  to  another  class  of  norms,  again  including  all  p-norms.  When  T  is 
real,  we  prove  a  slightly  stronger  result  where  the  upper  and  lower  bounds  differ  by  a 
factor  of  3.  Permuting  rows  and  columns,  we  see  this  solution  also  solves  the  cases 
where  any  other  subblock  (not  just  C)  of  T  is  fixed  and  the  others  vary. 

The  solutions  to  problems  2  and  3  extend  immediately  to  perturbations 


7-„  +  87-„ 

Tn 

r,3 

r„  +  Bri, 

ri2+87-,2  r,3 

T21 

722  +  6722 

T23 

and 

T2X 

722  +  8722     723 

7-31 

7-32 

7-33 

7-3, 

Til         733 

and  their  block  permutations.  The  problem  where  all  the  diagonal  blocks  7„  may  be  per- 
turbed is  harder,  and  our  upper  and  lower  bounds  may  sometimes  be  far  apart.  We  discuss 
these  extensions  in  section  6. 


Such  structured  perturbation  problems  are  of  interest  in  several  areas.  First,  they  arise 
in  the  design  of  control  systems  based  on  W"  optimization  [Doyle,  Fan  and  Tits].  Second, 
they  solve  certain  constrained  total  least  squares  problems  where  some  variables  are  fixed 
and  others  may  vary  [Demmel87b,  Golub,  Hoffman  and  Stewart].  Third,  they  provide  a 
measure  of  stability  for  the  equality  constrained  quadratic  programming  problem  [Gould]. 

Even  though  we  do  not  determine  the  smallest  perturbations  exactly,  we  do  exhibit  low 
rank  perturbations  which  satisfy  all  the  upper  bounds  given  above.  These  may  serve  as  start- 
ing points  for  optimization  routines  which  could  compute  the  smallest  perturbation  to  higher 
accuracy. 

If  T  is  large,  it  may  be  impractical  to  compute  T"'  explicitly  in  order  to  compute  the 
norms  of  its  sublocks.  We  present  a  method,  based  on  a  condition  number  estimator  of 
Hager  [Hager,  Higham],  for  estimating  the  1-norm  or  oo-norm  of  any  subblock  of  T"'  given 
the  ability  to  solve  Tx  =  b  and  T^y  =  c  efficiently.  Thus,  for  example,  if  T  is  large  and  sparse 
with  a  sparse  LU  factorization,  it  is  possible  to  compute  the  estimates  in  this  paper  cheaply. 

All  these  results  use  a  technique  originally  introduced  in  [Demmel87a]  for  estimating 
the  distance  to  the  nearest  "ill-posed"  numerical  analysis  problem,  such  as  the  distance  to  the 
nearest  singular  matrix  for  the  matrix  inversion  problem,  the  distance  to  the  nearest  defective 
matrix  for  the  matrix  eigenproblem,  the  distance  to  the  nearest  polynomial  with  multiple 
roots  for  the  polynomial  root  finding  problem,  the  distance  to  the  nearest  uncontrollable  sys- 
tem for  the  pole  placement  problem,  etc.  In  this  technique  we  associate  a  condition  number  k 
with  each  problem,  which  measures  the  sensitivity  of  the  solution.  Then  we  prove  one  or 
both  of  the  following  differential  inequalities: 

m-K^  s  ||Vk||  S  M-k2  (1.3) 

where  Vk  is  the  gradient  of  k,  and  0<msM  are  constants.  Inequalities  (1.3)  are  equivalent 
to  the  statement  that  the  condition  number  of  the  condition  number  is  (nearly)  itself.  In 
[Demmel87a]  it  was  shown  the  lower  bound  on  ||Vk||  in  (1.3)  yields  the  following  upper 
bound  on  the  distance  to  the  nearest  ill-posed  problem: 

dist  s  ^—  (1.4) 

m  -K 

and  the  upper  bound  on  1|Vk||  in  (1.3)  yields  the  following  upper  bound  on  the  distance: 

T^-  ^  dist   .  (1.5) 

For  example,  for  the  matrix  inversion  problem  we  choose  k=||7"~'||,  in  which  case  (1.3)  is 
satisfied  with  m  =M  =  1;  from  (1.4)  and  (1.5)  this  yields  the  well  known  result  that  the  dis- 
tance from  T  to  the  nearest  singular  matrix  is  ||r~'||~'.  We  solve  problems  1)  to  3)  above  by 
choosing  k  to  satisfy  one  or  both  of  the  inequalities  in  (1.3). 

For  more  complicated  block  perturbations  than  those  in  l)-3),  the  inequalities  in  (1.3) 
may  no  longer  provide  explicit  solutions,  but  they  do  provide  a  numerical  optimization  tech- 
nique: steepest  ascent  on  the  condition  number.  By  perturbing  T  in  an  allowed  direction  so  as 
to  make  the  norm  of  T"'  increase  as  quickly  as  possible,  we  can  find  good  upper  bounds  on 
the  distance  to  the  nearest  singular  matrix.  We  intend  to  pursue  this  approach  in  future  work. 

The  rest  of  this  paper  is  organized  as  follows.  Section  2  states  the  lemmas  from 
[Demmel87a]  which  justify  conclusions  (1.4)  and  (1.5)  above.  Sections  3,  4  and  5  solve  prob- 
lem 1),  2)  and  3),  respectively.  Section  6  discusses  the  block  3  by  3  matrix  case.  Section  7 
discusses  inexpensive  estimators  for  the  bounds  in  previous  sections.  Section  8  discusses 
future  work. 


2.  Basic  Differential  Inequalities 

In  this  section  we  state  the  basic  lemmas  we  will  need  for  our  lower  and  upper  bounds 
on  the  norm  of  the  smallest  rank  lowering  perturbation;  these  originally  appeared  in 
[Demmel87a].  Lemma  1  will  be  used  to  provide  lower  bounds  on  the  distance  to  the  nearest 
singular  matrix: 

Lemma  1:  Let  k(x)  be  a  positive  functional  on  a  normed  linear  space  X  with  norm  |1||.  Sup- 
pose 

U^sup    l'^(^  +  ^)-'^(^)l    :s  MkHx)  (2.1) 

for  all  x€-V,  where  M  is  a  positive  constant.  Then  for  all  *,  k(z)  is  finite  for  all  z  such  that 

1 


\2-X      < 


Mk(x) 


Proof:  Let  x(s)  be  a  straight  line  in  .V  starting  at  x{0)  =  x,  and  parameterized  by  arclength  s 
(i.e.  ||x'(j)||  =  1).    Consider  the  scalar  function  k(x(j)).  By  assumption 

,.               k(x(^  +  h))-  k(jc(5))    _   ,,      2/^    /■  \\ 
limsup      ^    ' " ^ — ^— "-  s  M  K  {x{s)) 

h-Q~  h 

SO  that  K(x(i))  is  bounded  above  by  the  function  y{,s)  where 

y'{s)  =  M-yHs)    ,     y(0)  =  K(x)    . 

It  is  easy  to  verify  that 

/-  ^               k(x) 
y(s)  =  ^^ 


1  -  Mk{x)s 

which  is  bounded  for  s  <  (A/k(x))"'.  Since  x(s)  was  an  arbitrary  straight  line  starting  at  x, 
this  proves  k(2)  is  bounded  in  a  ball  around  x  of  radius  at  least  (Mk(x))~'  as  desired.  D 

If  A' is  Euclidean  space  and  k  is  differentiable,  (2.1)  is  equivalent  to  ||Vk||  s  Mk^. 

Lemma  2  will  be  used  to  provide  upper  bounds  on  the  distance  to  the  nearest  singular 
matrix: 

Lemma  2:  Let  k(x)  be  a  positive  functional  on  a  normed  linear  space  A' with  norm  ||||.    Let 
x(j)  be  a  curve  in  .V  starting  at  x(0)  =  x  and  parameterized  by  arclength  s.    Suppose 

,.       .     r     K(x(s+h))—K(x(s))     ^  2/      /    \\  /■.,    «v 

bminf      ^    ' " ^    '■  "   &  m-K'^(x(s))  (2.2) 

h-0+  h  ^        ' 

for  all  s,  where  m  is  a  positive  constant.  Then  k(z)  is  infinite  for  some  z  satisfying 

Proof:  From  (2.2)  k(x(5))  is  bounded  below  by  y(j)  where 

y'(.)  =  my^s)    ,    y(0)=K(x)    . 

As  in  Lemma  1,  we  see 

r  ^  k(x) 

1  —  mK\x)s 

which  has  a  pole  at  j=(mK(x))~'.  Since  y{s)  bounds  k(x(j))  below,  k(x(j))  must  have  a 
pole  for  some  jq  ^  (""«(■*))'•  Since  s  is  arclength,  this  proves  the  lemma.  □ 

If  ,V  is  Euclidean  space,  k  is  continuously  differentiable,  and  ||-||  is  the  2-norm,  then 
assumption  (2.2)  may  be  replaced  by  ||Vk||  a  mv(} ,  because  this  implies  the  curve  x(j)  of 
steepest  ascent  of  k  satisfies  k'(x(j))  &  mK^{x{s)). 


3.  The  Smallest  Rank  Lowering  Perturbation  of  One  Subblock 

In  this  section  ||||  will  denote  any  matrix  operator  norm,  as  well  as  the  vector  norm 
which  induces  it;  ||-||d  ^ 
(1.2),  andletC=||Z||. 

Theorem  1:  The  smallest  IIBDII  such  that 


/i\\  denote  the  dual  vector  norm.    Let  T  and  T    '  be  as  in  (1.1)  and 


C 

D  +  8D 


(3.1) 


issingular  is  l/t- If  u^Zv=  4,  where  ||«'"||d  =  ||v||  =  1,  then  we  may  take  6D  =  -vu  /{.This 
theorem  applies  to  real  perturbations  of  real  matrices  as  well  as  complex  perturbations  of 
complex  matrices.  If  the  original  matrix  is  real  symmetric  (complex  Hermitian)  and  the  norm 
is  the  2-norm  1|||2,  the  minimizing  8D  can  be  chosen  real  symmetric  (complex  Hermitian). 

Proof:  If  hT  is  a  small  perturbation  of  T,  then  to  first  order 
(r  +  87")"'  =  r"'  -  r"'87T''.  Applying  this  to  (3.1)  shows  that  to  first  order  in  6D  the 
inverse  of  the  matrix  in  (3.1)  is  given  by 


XhDY  XhDZ 
ZhDY   ZhDZ 


(3.2) 


Consider  the  (2,2)  entry:  to  first  order  Z  is  perturbed  by  -Z8DZ.    This  implies  that  to  first 


order  the  perturbation  8^  in  the  norm  {  of  Z  cannot  be  more  than  ||Z8DZ| 
by  Lemma  1,  the  smallest  ||8D||  such  that  {  is  infinite  satisfies 


mm 


I8DI 


(2||8D||.Thus, 


(3.3) 


By  (3.2),  no  other  subblock  of  7""'  can  become  unbounded  unless  Z  becomes  unbounded,  so 
(3.3)  is  also  a  lower  bound  on  the  smallest  ||8D||  such  that  the  matrix  in  (3.1)  is  singular.  To 
see  that  this  lower  bound  is  attained  for  some  8D,  simply  choose  8D  proportional  to  the 
rank-one  matrix  -vu^,  where  v  and  u^  are  unit  vectors  satisfying  ||Z||  =  u^Zv  (i.e. 
||„7"||^  =  ||v||  =  1).  This  rank-one  choice  of  hD  perturbs  Z  by  a  rank-one  matrix  proportional 
to  Zvu^Z,  which  is  easily  seen  to  attain  the  lower  bound  in  (3.3).  Therefore,  by  Lemma  2,  we 
have  min||8D||  ^  £"'.  That  6D=-vu'^/4  makes  T  singular  may  also  be  verified  via  the 
Sherman-Morrison  formula  [Golub  and  Van  Loan,  p.  3]. 

If  T  is  real  symmetric  (complex  Hermitian)  and  the  norm  is  the  2-norm,  then  Z  is  real 
symmetric  (complex  Hermitian)  and  u  and  v  may  clearly  be  chosen  equal  (complex  conju- 
gate), so  that  hD  is  real  symmetric  (complex  Hermitian).  n 

Since  the  smallest  perturbation  is  rank-one,  we  see  that  the  smallest  distance  in  the 
matrix  2-norm  is  also  the  smallest  distance  in  the  Frobenius  norm;  these  results  were  obtained 
by  other  methods  in  [Demmel87b],  where  rectangular  matrices  of  arbitrary  rank  were  treated 
as  well.    It  was  also  solved  in  [Watson]. 

Note  that  when  4=0,  no  perturbation  hD  of  D  will  make  T  singular.  This  implies  that 
the  distance  1/^  may  be  a  very  ill-conditioned  function  of  the  entries  of  T.  For  example,  if 


T  = 


£     1 

1   0 


then  the  smallest  perturbation  of  the  2,2  entry  which  makes  T  singular  is  e~'.  As  c  goes  to 
zero,  the  norm  of  this  perturbation  goes  to  infinity,  and  when  €  =  0,  no  perturbation  lowers 
the  rank.   In  the  case  of  the  2-norm,  a  sensitivity  analysis  of  1/C  was  done  in  [Demmel87b]. 

When  A   is  square  and  nonsingular,  a  more  elementary  construction  yields  the  same 
result: 


A         B 

I 

0 

C   D  +  bD 

CA-' 

/ 

A  B 

0   D-CA~'B  +  8D 

is  singular  if  and  only  if  Z>  -CA  ~'B  +  6D  is  singular,  and  the  smallest  perturbation  ||8D||  that 
makes  this  block  singular  is  given  by  the  well  known  expression  ||(D  —CA  ~'fl)~'||~'.    But 

A"'+A~'fi(D-CA"'B)"'CA"'    -A~^B  {D -CA'^B)'^ 
-(D-CA'^By^CA-^  {D-CA-^B)-^ 

as  desired. 


A    B 

W   X 

C   D 

Y    Z 

so  \\Z\ 

=  IK 

D-BA 

->c)-' 

4.  The  Smallest  Rank  Lowering  Perturbation  of  Two  Sublocks 

In  this  section  we  are  interested  in  the  smallest  ||6r||  such  that  r  +  6r  is  singular  where 

\hA     0 
0     8D 

Here  ||-||  may  be  any  operator  norm  subject  to  the  restriction 

5A     0 


max(||8A||,  ||8Di|)  = 


0     8D 


this  is  satisfied,  for  example,  by  all  the  matrix  p-norms.  As  before,  ||||  will  also  denote  the 
vector  norm  which  induces  the  matrix  norm,  and  ||||o  the  dual  norm,  and 

W  X 

Let(o=||W||,4=||X||,Ti=||y||,andC=||Z||. 

Theorem  2:  Let  k  =  max(C,aj,(4-n)"').    Then  the  smallest  ||diag(6A,8B)||  such  that 

A  +8A         B 
C        D  +  bD 


is  singular  is  bounded  by 
1     _ 


1 


2k         max(cu,C)  +  (4t)) 


1/2 


min  ||diag(8A,8B)|| 


where  -y  £  3"'^.    More  specifically,  if  we  define  a  =  max((i),C)  /  (^■n)^'^  then 

1  ifo&l 

_1_ 
a 


q+a^) 


2\\I2 


l-a2 


if  3-"2  <  a  <  1 
ifa  s  3-"2 


When  T  and  87  are  real,  this  result  can  be  improved  to 

; ;^ ; —  s  min  ||diag(6A,6fl)||  ^ r-- 

max(a,,C)  +  (4^))"^  ^  ^^        max(«o,C,aTi)"2) 

note  that  the  upper  bound  is  at  most  twice  the  lower  bound. 

Proof:  To  first  order. 


(4.1) 


(r  +  B7)"'  =  r~'  -  T~'87T"'  = 


W   X 
Y    Z 


WhAW  +  XhDY   W&AX  +  XBDZ 
YBAW  +  ZbDY     Y&AX  +  ZhDZ 


Thus,  to  first  order  the  perturbations  in  w,  i,  i)  and  i  are  bounded  by  (w^  +  4T))||6r||, 
6(o)  +  C)||67-||,  •n(to  +  C)||S7-||,  and  (C^  +  e^)||6r||.  Let  kq  =  max(w,i)  +  (^11)"^  Therefore,  to 
first  order  the  perturbation  to  kq  is  bounded  by 

8ko  £  (maxa^  +  ^Tl  ,  to^  +  ^Ti)  +  (a,  +  i)aTi)"^)||8r||  s  k§  ||8r||    , 

as  is  easily  verified  by  computation.  Note  that  ||r"'||  cannot  become  infinite  unless  kq 
becomes  infinite.  Applying  Lemma  1  as  in  the  proof  of  Theorem  1,  we  see  that  the  smallest 
llSril  =  ||diag(8A,6D)||  such  that  7"  + 8r  is  singular  is  bounded  by 

1 


|8ri 


V,4       0 

and    U'^  = 

«I     0 

0    vo 

.0    ul 

proving  the  lower  bound. 

To  prove  the  upper  bound  we  will  exhibit  perturbations  diag(8A,8D)  which  attain  it. 
There  are  3  cases  as  indicated  in  (4.1).  In  case  1,  when  a  =  max((i),C)  /  (f-n)"^  a  1,  we 
know  from  Theorem  1  that  there  is  a  perturbation  of  A  alone  (if  o)2:{)  or  of  D  alone  (if 
(2;u))  of  norm  at  most  1/k  =  l/max(to,0  which  makes  7"+ 87  singular.  In  case  2,  when 
l>a>3~"^,  the  same  perturbation  of  A   alone  or  D   alone  has  norm   l/max(a),()  =  . 

From  Theorem  1,  f>A  and  8D  may  be  chosen  real  symmetric  (complex  Hermitian)  if  T  is  real 
symmetric  (complex  Hermitian)  and  the  norm  is  the  2-norm. 

In  case  3,  when  a  s  3""-,  we  must  have  both  6A  and  8D  nonzero.  We  will  exhibit  rank 
one  matrices  8A  =  v^wj  and  8D  =  vduJ  of  norms  at  most  (l  +  a^)"^/(l-a^)  which  make  T  +  hT 
singular.  Let 


V  = 


be  n  by  2  and  2  by  n  matrices  respectively;  note  that  87=  diag(8A,6D)  =  Vt/'".  By  the 
Sherman-Morrison-Woodbury  formula  [Golub  and  Van  Loan,  p.  3]  T  +  hT  is  singular  if  and 
only  if  / +  1/^7"' V  is  singular,  i.e. 

(1+uIWv^){1+uIZvd)  -  (uIXvd)-{uIYva)  =  0    .  (4.2) 

We  choose  v^,  uj,  v^,  and  ul,  as  follows.  First  find  unit  vectors  v^  and  u^  such  that 
u\Xv^  =  i,  and  unit  vectors  v,,  and  mJ|  such  that  uli^Yvy^  =  r).    Now  let 

ul  =  ul  /  (et,)"^ 

4  = "?;  /  ii^y  ' 

note  that  1 1 87 1 1  =   |p|/(eTi)"'  and 

u.\Wv^  =  p-w     where     |w|  s  a 


udZvd  "  p-i.    where     |t|  ^  a 
(uJXvD)-(«5yv^)  =  p2 


so  we  may  rewrite  (4.2)  as 


(l+p-a))-(l  +  PiD  -  P 


2  _ 


whose  solutions  are 


p±  = 


^    C  +  co  ±  ((i^-lY  +  4) 


\V2 


2(1  -  0)1) 


It  is  easy  to  verify  that 


min  |p*  I  s 


(1  +  °^) 
l-a^ 


2\l/2 


as  desired.  It  is  also  easy  to  verify  that  if  0<as3""^,  (l+a2)"^/(l-a^)  £  a~',  and  if 
3~"^sa<l,  (l  +  a^)"^/(l  — a^)  &  a~'.  When  T  is  real,  ^  and  <o  are  real  and  one  may  verify 
that  min  jp^t  I  ^  1  for  all  1^  and  u  less  than  1  in  magnitude.   This  completes  the  proof.  □ 

The  factor  -y  in  the  upper  bound  could  be  improved  slightly  by  a  more  careful  analysis 
of  min  \p±\;  even  then,  for  a  given  T,  the  upper  bound  constructed  in  the  theorem  may  be 
smaller  than  the  upper  bound  ^/k.  Note  that  the  lower  bound  for  the  2-norm  is  necessarily  a 
lower  bound  for  the  Frobenius  norm.  Also,  2"^  times  the  upper  bound  is  an  upper  bound  for 
the  Frobenius  norm  because  the  perturbations  attaining  it  have  rank  at  most  2. 

When  T  is  real  symmetric  (complex  Hermitian)  and  the  norm  is  the  2-norm,  then  of 
course  a  solution  is  B7"  =  -X^^^I,  ||B7"||  =  |Xniin  |.  where  X.n,in  is  the  eigenvalue  of  T  of  smal- 
lest absolute  value.  However,  the  Frobenius  norm  of  this  57"  is  n"^  times  larger.  We  may 
construct  a  rank  lowering  symmetric  perturbation  whose  Frobenius  norm  is  within  2"^  of  the 
upper  bound  y/K  for  the  2-norm  as  follows:  Since  X  =  Y^  (X  =  Y*)  we  may  take  V{=m^ 
(vj=i7.^)  and  tt{  =  VT,  («{  =  v,,)  in  the  proof.  The  scalar  p  will  be  real,  and  both  hA  and  6D 
will  be  rank  one  real  symmetric  (complex  Hermitian),  so  the  Frobenius  norm 
||diag(8>l,6D)||r  s  2'''||diag(8/l,8D)||2. 

A  related  problem  to  the  one  solved  in  Theorem  1  is  to  find  the  smallest  ||6A||  such  that 
the  symmetric  matrix 


H 
A  +  bA 


A'^+hA'^ 
M 


(4.3) 


is  singular.  This  problem  arises  in  analyzing  the  stability  of  the  equality  constrained  quadratic 
programming  problem  [Gould] 

min     —x^Hx+c^x    ; 

x.Ax^b    2 

we  assume  all  matrices  are  real  and  use  the  2-norm  ||-||2.    The  Kuhn-Tucker  first  order  condi- 
tions for  this  problem  are 


(4.4) 


H   A^ 
A     0 

X 

-V- 

= 

—  c 
b 

(ji  is  the  vector  of  Lagrange  multipliers). 


The  smallest  ||8A||2  which  makes  this  linear  system  singular  is  a  measure  of  its  stability,  just 
as  the  usual  condition  number  ||A||||A  ~'||  for  Ax  =  b  is  inversely  proportional  to  the  distance 
||i4~'||~'  from  A  to  the  nearest  singular  matrix  [Kahan,  Demmel87a]. 

The  problem  in  (4.3)  is  more  difficult  than  the  one  solved  in  Theorem  2.   For  example, 
if  both  H  and  M  -AH~^A^  are  invertible,  then 


r-i  = 


A^ 
M 


«->-hh-U'"(m-aw-'a'')-'a«-'   -//-'/i'"(m-aw-'a'")~' 


-{M-AH-^A^y^AH 


(M-AH 


■UO- 


Thus,  if  ff  is  positive  definite  and  M  negative  definite  (or  vice  versa),  J"'  exists  for  all  A. 
Therefore  a  necessary  (and  clearly  sufficient)  condition  for  the  existence  of  a  rank  lowering 
perturbation  hA  is  that  both  H  and  M  have  nonnegative  (or  nonpositive)  eigenvalues;  this 
condition  cannot  be  captured  in  terms  of  norms  of  the  subblocks  of  T~K  This  necessarily 


limits  the  applicability  of  the  following  theorem: 
Theorem  3:  Let 


H   A^ 

,  r->  = 

w  x^ 

and   87  = 

0    ba'" 

A    M 

X     Y 

[hA      0 

where  all  matrices  are  real  and  T  and  57  are  symmetric.    Let  (i)  =  ||W||2,  t=||X|l2,  •n  =  ||l'||2. 
The  smallest  ||6r||2  =  ||8A||2  such  that  7" +  67  is  singular  satisfies 

If  in  addition  both  W  and  Y  are  positive  semidefinite  (or  negative  semidefinite),  then 

this  upper  bound  is  within  a  factor  of  2  of  the  lower  bound. 
Note:  The  upper  bound  does  not  apply  to  the  matrix  in  (4.4). 
Proof:  To  first  order, 


(7  +  87) 


■1   _ 


7"'  -7    '677 


■1  _ 


W  x'^ 

X     Y 


WhA'^X  +  X^hAW    W8A'"y  +  X'"6AX^ 
XhA'^X+YhAW      XhA'^Y+YhAX'^ 


Thus,  to  first  order  the  perturbations  in  w,  ^  and  ii  are  bounded  by  2a)41|6A||2, 
(g2  +  wT|)||6A|l2  and  2t)4||8A|12.  Let  k  =  ^+(to-n)"^.  To  first  order  the  perturbation  to  k  is 
bounded  by 

6k  <   (e=  +  coT,  +  24(a,T,)"2)||8A||2   =    k2 


I8AI 


Note  that  ||7    '||  cannot  become  infinite  unless  k  becomes  infinite.    By  Lemma  1,  the  smallest 
||6A||2  such  that  7  +  87  is  singular  is  satisfies 

||8A||2  ^  ^ 


proving  the  lower  bound. 

To  prove  the  upper  bound  we  will  consider  2  cases,  when  a  =  ^l{<ji-t\y^  s  1,  and  when 
a  >  1.  We  assume  without  loss  of  generality  that  both  IV  and  Y  are  positive  semidefinite 
(otherwise,  negate  7).  Consider  first  the  case  a  s  1.  We  will  exhibit  a  6A  whose  norm 
satisfies  the  upper  bound.  Let  k^  and  Vt|  be  unit  vectors  such  that  ul^Wu^^m  and  v\Yvy^  =  'e\. 
Let  u  =  ttu/w"'  and  v  =  pv.^/-n"^.  Let  8A  =  vu^  and 

_       Fn    ,.rl 

and     U^  = 


V  = 


u   0 
0    V 


0     V 
u^    0 


then  87  =  VU^  and  again  by  the  Sherman-Morrison-Woodbury  formula  7  +  67  is  singular  if 
and  only  if  /  +  U^T~W  is  singular,  or 

(1+  v'^Xu)^  -  {u^Wu)iv^Yv)  =  0    .  (4.5) 

With  our  choice  of  u  and  v  we  may  write  v^Xu  =  pp  where  |p  |  s  a,  as  well  as  u^Wu=l  and 
v^l'v  =  p^,  so  that  (4.5)  becomes 

(l  +  p3)2-p2  =  0 

or  p±  =  -(p±l)"'  (if  |P|  =  1  there  is  just  one  solution  |p|  =  .5).  Clearly  min  [p*  |  £  1,  so 
||8A||2  =  ||"|!2"||»'||2  ^  (w'n)"^'^.  yielding  the  upper  bound  in  the  case  a^l. 

When  a  >  1,  we  need  to  choose  u  and  v  differently.  Let  u^  and  v^  be  unit  vectors  such 
that  v[Xuj=4,  and  let  u=pu^/i  and  v=vj.  Note  ||6A||2=  |p|/4-  With  this  choice  of  u  and  v 


we  may  write  {u^Wu)(v''^Yv)=pp^ ,  where  Ospsa   ^<1.  Thus  (4.5)  becomes 

(l  +  p)2-pp2  =  0 

which   has   solutions   p±  =  -(1±3"^)"*.     Since   min|p- |  s  1,    ||8A||2  s  |~',   yielding   the 
upper  bound.  □ 

Note  that  it  is  always  possible  to  find  a  complex  perturbation  6A  satisfying  the  upper 
bound,  independent  of  the  signs  of  the  eigenvalues  of  H  and  M;  this  follows  directly  from  the 
proof.  However,  if  we  change  the  problem  to  finding  the  smallest  complex  perturbation  6A 
such  that  the  Hermitian  matrix 

'h  a* 

A    M 

is  singular,  we  get  the  same  necessary  and  sufficient  conditions  on  the  eigenvalues  of  H  and 
M  for  the  existence  of  6A  as  when  everything  is  real. 

As  in  Theorem  2,  the  lower  bound  for  the  2-norm  is  a  lower  bound  for  the  Frobenius 
norm,  and  the  upper  bound  for  the  2-norm  is  also  an  upper  bound  for  the  Frobenius  norm 
because  the  perturbation  attaining  the  upper  bound  has  rank  1. 

When  H  is  positive  (or  negative)  definite  in  (4.4),  it  is  easy  to  see  that  the  matrix  there 
is  invertible  if  and  only  if  A  has  full  rank,  and  therefore  the  smallest  rank  lowering  perturba- 
tion of  the  matrix  is  simply  the  smallest  rank  lowering  perturbation  of  A,  and  has  norm 
anii„(i4).  When  A  is  indefinite  o^^^{A)  is  still  an  upper  bound  on  the  norm  of  the  smallest 
rank  lowering  perturbation,  although  it  is  no  longer  a  lower  bound  as  well. 

In  the  next  section  we  will  extend  Theorem  3  to  permit  perturbations  in  H  as  well  as  A. 


5.  The  Smallest  Rank  Lowering  Perturbation  of  Three  Subblocks 

In  this  section  we  are  interested  in  the  smallest  \\hT\\  such  that  T -I- 87  is  singular  where 

\hA    IB 
0     hD 

Here  ||-||  may  be  any  operator  norm  subject  to  the  restrictions 


max  (||£||  ,  ||f  II)  = 


E    0 
0    F 


0   G 
0    0 


=  IICI 


and  that  the  norm  of  a  matrix  is  at  least  as  large  as  the  norm  of  any  subblock.   These  restric- 
tions are  satisfied,  for  example,  by  all  the  matrix  p-norms.  As  before,  let 


7-1  = 


W  X 
Y    Z 


andu,=  ||W||,4=||X||,-n=||y||,  andC  =  ||Z||. 

Theorem  4:  Let  k  =  max(i;,(o,-n,(gTi)"2).   xhen  the  smallest  ||8r||  = 


bA    bB 
0     hD 


such  that 


A+bA    B  +  hB 
C        D  +  bD 


is  singular  is  bounded  by 
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-^  s  min  ||8r||  ^  ^    , 

3k  "       "  K 

where  -y  s  3"'.    More  specifically,  if  we  define  a  =  max(u),C,-n)  /  (^t))"^  then 

1  ifasl 

J_  if  3-1/2 

a 


if  3""^  <  a  <  1 
if  a  5  3-"^ 


(5.1) 


1-a^ 


If  T  and  hT  are  real,  this  result  can  be  improved  to 


1 
3k 


min  ||6r||  ^  — 

K 


Proof:  To  first  order, 


(r  +  8r) 


■1  _ 


•1  _ 


'57"7"" 


Y    Z 


WhA-W  +  WhBY  +  XhDY   WhAX  +  WhBZ  +  XhDZ 
YhAW+YhBY  +  ZhDY      YhAX+ YhBZ->rZhDZ 


Thus,  to  first  order  the  perturbations  in  w,  ^,  "n  and  4  are  bounded  by  (a)^  +  o)T)  +  £T))||8r||, 
(a)t  +  (o4+401|B7"|j,  (Tiw  +  Ti2  +  CT))|18ri|,  and  (ti4+ tiC  + (^)i|87'||.  Therefore,  to  first  order  the 
perturbation  to  k  is  bounded  by 


isri 


CJ     +  a)T|  +  ^T) 

tiw  +  ti^  +  Cti 

.5(eTi)''2.(2u,  +  2C+Ti)  +  .5(t,/0"'u,C 


if  (i)=  K 
if  {=K 
if  1)  =  K 

if(4Tl)"'2=K 


as  is  easily  verified  by  computation.  Note  that  IJ?"  '||  cannot  become  infinite  unless  k 
becomes  infinite.  The  only  term  in  6k  that  prevents  us  from  immediately  concluding  that 
6k  s  3k^  ||8r||  is  .5(-n/4)"2wS  if  (4-n)"2=K.  But  (€-n)"'=K  implies  (eT))"^2:-n  or  i^r^;  thus 
(•i)/4)"-<l  so  5k  <  3k'  ||87"||.  From  Lemma  1  we  conclude  ||6r||  2:  1/(3k)  if  7"  +  87  is  singu- 
lar, proving  the  lower  bound. 

To  prove  the  upper  bound  we  will  exhibit  perturbations  hT  which  attain  it.  The  three 
cases  indicated  in  (5.1)  are  essentially  the  same  as  in  (4.1)  of  Theorem  2:  In  case  1,  when 
a  =  max(a),4,T|)  /  (^fl)'^  ^  1.  we  know  from  Theorem  1  that  there  is  a  perturbation  of  A 
alone  (if  to  is  largest),  B  alone  (if  t)  is  largest)  or  D  alone  (if  t  is  largest)  of  norm 
1/k  =  l/m2ix(iD,5,T))  which  makes  T  +  hT  singular.    In  case  2,  when  l>a>3~"^,  the  same 

perturbation  of  A,  B  or  D  alone  has  norm  l/max(<D,{,Ti)  = 


aK 


In  case  3,  when  a  s  3  "^,  we  perturb  8A  and  6D  just  as  in  the  proof  of  Theorem  1, 
leaving  8fi  =  0.  This  completes  the  proof.  D 

It  is  interesting  that  it  is  never  necessary  to  perturb  all  three  entries  A,  B  and  D  simul- 
taneously to  prove  the  upper  bound  in  the  theorem.  Note  also  that  the  lower  bound  for  the 
2-nonn  ||||2  is  a  lower  bound  for  the  Frobenius  norm  ||-||f,  and  that  2"^  times  the  upper 
bound  for  the  2-norm  is  an  upper  bound  for  the  Frobenius  norm,  since  the  perturbations 
attaining  the  upper  bound  have  rank  at  most  2. 

Now  we  return  to  the  problem  considered  in  the  last  section:  Let 


r  = 


H  A' 
A     M 


7-1  = 


W  X' 
X     Y 


and   6r  = 


bH   6A'" 
&A      0 


(5.2) 


where  all  matrices  are  real  and  T  and  hT  are  symmetric.  We  seek  min||87"||2  where  7-i-8r  is 
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singular. 

Theorem  5:  Let  T,  7""'  and  67"  be  real  matrices  as  in  (5.2).    Let  w=|IW||2,  ^=||X||2,  'n  =  ||y||2, 
and  K  =  max(ti),|,(wT))"^).   The  smallest  ||6r||2  that  makes  r  + ST  singular  satisfies 

-^  s  min  ||6r||2    . 

If  in  addition  either  k  =  (d  or  both  W  and  Y  are  positive  semidefinite  (or  negative  semidefin- 
ite),  then 

min  ||8r||2  ^  —    . 


Proof:  To  first  order 

(r  +  sr)"'  =  r"'-7"~'87T"' 


W  X^ 
X     Y 


WhHW  +  WbA^X  +  X^hAW   W8//x''+W6A'"y  +  x'"6AX^ 
XbHW  +  XbA^X+YhAW       XbHX'^  +  XhA^Y+YhAX'^ 


Thus,  to  first  order  the  perturbations  in  w,  4  and  t)  are  bounded  by  (<D^  +  2<D4)||8r||2, 
(^^  +  o)^  +  (i)Ti)||5r||2  and  (4^  +  24t))||87"||2.  Thus,  to  first  order  the  perturbation  in  k  is 
bounded  by 


8k  s  II87-I 


0)^  +  20)4 

24(0,  ■n)''2+. 50,(0)  ■n)"2+.542(o,/.r,)"2 


if  O,  =  K 

if  4=K 

if   (o,T,)"2=K 


Note  that  ||r~'||  can  only  become  infinite  if  k  becomes  infinite.  The  only  term  in  8k  that 
seems  to  prevent  us  from  concluding  that  6k  s  3k^||87"|1  is  .54^(a,/'n)"^  when  k  =  (ojt))"^. 
But  K  =  (o)Ti)"^  means  (o,Ti)"^aa,  or  r\^iii,  whence  w/tisI  and  6k  s  3k^||87'||2.  From 
Lemma  1  we  conclude  min||87"||2  2:  1/(3k)  as  desired. 

For  the  upper  bound,  there  are  two  cases,  depending  on  whether  k  =  g,,  or  both  W  and  Y 
are  positive  semidefinite  (or  negative  semidefinite).  In  the  first  case  Theorem  1  guarantees 
the  existence  of  a  symmetric  perturbation  of  H  alone  with  8A=0  and  ||6//||2  =  l/co  =  1/k 
such  that  r  +  87"  is  singular.  In  the  second  case  Theorem  3  guarantees  the  existence  of  a  per- 
turbation of  A  alone  with  6//  =  0  satisfying  the  upper  bound.  D 

It  is  interesting  that  it  is  never  necessary  to  perturb  A  and  H  simultaneously  to  prove  the 
upper  bound  in  the  theorem.  As  with  the  Theorem  3,  the  upper  and  lower  bounds  for  the  2- 
Dorm  are  also  bounds  for  the  Frobenius  norm. 


6.  Extensions  to  3  by  3  Block  Matrices 

In  this  section  we  discuss  various  smallest  rank  lowering  structured  perturbations  87  of 
T  where  T  and  67"  are  3  by  3  block  matrices.  Let 


T  = 


Tu 

Tn 

^13 

T2i 

7-22 

3-23 

Tii 

7-32 

7'33 

and    T 


-1  _ 


jU 

7-12 

7-13 

jlX 

7-22 

7-23 

j-31 

7-32 

7-33 

(6.1) 


where  T  and  T   ^  are  conformally  partitioned  (i.e.  Ty  and  T''  have  the  same  dimensions). 
Our  upper  and  lower  bounds  will  be  in  terms  of  the  t^  =  ||r'^||.  These  results  include  the 
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cases  where  the  last  block  row  and/or  column  of  T  is  null. 

Theorems  2  through  5  extend  to  this  case  with  essentially  no  changes  in  their  proofs.  In 
Theorems  6  to  9  it  is  possible  for  no  rank  lowering  perturbation  6r  to  exist.  In  Theorems  6 
and  8  the  lower  and  upper  bounds  will  both  be  infinite  in  this  situation.  If  the  lower  bound  in 
Theorems  7  and  9  is  infinite  this  also  implies  no  rank  lowering  perturbation  exists.  Unfor- 
tunately, the  extension  of  this  approach  to  perturbations  of  all  diagonal  blocks  7",,  is  not  as 
complete,  and  it  is  possible  for  a  large  gap  to  exist  between  the  upper  and  lower  bounds. 

Theorem  6:  Let  T  and  T"'  be  as  in  (6.1),  and  t,;  =  ||r'^||.  Let 


6r  = 


be  conformally  partitioned  with  T.  Let  the  norm  ||-||  satisfy  ||diag(X,y)||  =  max  (||X|l,||y||). 
Let  K  =  max(Ti,,T22,(Ti:T2,)"-).    Then  the  smallest  ||S7"||  such  that  7"  +  ST"  is  singular  satisfies 


87,, 

0 

0 

0 

87-22 

0 

0 

0 

0 

-!-  <  min  ||diag(8A,8B)| 
2k 


K 


where  -y  £  3"'.    More  specifically,  if  we  define  a  =  max(Tii,T22)  /  (•'■i2'''2i)   '  then 

if  aal 

if  3""2  <  a  <  1 


1 

J_ 

a 

(1  +  a^)"^ 

I      1-a' 

if  a  <  3 


-1/2 


If  T  is  real,  this  result  may  be  improved  to 
1 


max(Tii,T22)    +    (Ti2'''2i)'  " 


min  ||diag(8A,8B] 


1 


max(Tii,T22,(Tl2T2l)    ') 


note  that  the  upper  bound  is  at  most  twice  the  lower  bound.  If  T  is  real  symmetric  (complex 
Hermitian)  and  the  norm  is  the  2-norm  ||-||2.  then  there  are  8,4  and  8D  satisfying  the  upper 
bound  of  the  theorem  which  are  also  real  symmetric  (complex  Hermitian). 

Sketch  of  Proof:  The  proof  is  essentially  identical  to  that  of  Theorem  2.  The  only  difference 
is  that  the  rank  2  matrices 


V  = 


whose  product  VU^  equaled  67"  in  the  proof  of  Theorem  2  must  be  enlarged  to 


va     0 

and    U'^  = 

"I     0 

[o     vd\ 

[o    u'd\ 

va 

0 

0 

VD 

0 

0 

and   I/'"  = 


"loo 

0    ul   0 


Theorem  7:  Let  T  and  T       in  (6.1)  be  real  and  symmetric,  and  let  the  partitioning  be  sym- 


metric as  well  {T,j  =  Jj,).    Let  t,_, 


•j'- 


Let 


&T  = 


0 

6rJ,  0 

Til 

0      0 

0 

0      0 

be  conformally  partitioned  with  T.   The  smallest  ||8r||2  such  that  r  +  6r  is  singular  satisfies 
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1 


—  £  min||6r||2    . 


T21  +  (tiiT22) 

If  in  addition  both  r"  and  T^^  are  positive  semidefinite  (or  negative  semidefinite),  then 

1 


minllSrih  £ 


max(T2i,(TiiT22)'^) 


Sketch  of  Proof:  The  proof  of  Theorem  3  is  essentially  identical,  except  for  the  same  minor 
change  required  for  the  proof  of  Theorem  6. 

Theorem  8:  Let  T  and  7"'  be  as  in  (6.1),  and  t,j  =  \\T'^\\,  where  the  norm  satisfies  the  same 
conditions  as  in  Theorem  4.  Let 


6r 


be  conformally  partitioned  with  T.  Let  k  =  max(Tii,T22,T2i,(Ti2T2i)"').  Then  the  smallest 
||6r||  such  that  7"  +  BT  is  singular  is  bounded  by 

1 


BTu 

sr,2 

0 

0 

6r22 

0 

0 

0 

0 

min  llSri 


K 


where  7  £  3'  ■^.    More  specifically,  if  we  define  a  =  max(Tii,T22,T2i)  /  (T12T21)      then 


1 

J_ 
a 


if  a&l 

if  3""^  <  o  <  1 


li±^^        ifa.3- 


(5.1) 


l-a= 
If  T  and  &T  are  real,  this  result  may  be  improved  to 

—  <  min  lieril  <  —    . 
3k  "       "  K 

Again,  with  only  slight  modification  the  proof  of  Theorem  4  goes  through  for  these 
results  as  well. 

Theorem  9:  Let  T  and  r~'  be  real  symmetric  matrices  as  in  (6.1),  where  the  partitioning  is 
also  symmetric  so  7,y  =  tJ,  and  t,j  =  \\T'^\\2  =  7j,.  Let 


bT  = 


be  partitioned  conformally  with  T.  Let  k  =  max(Tii,T2i,(TiiT22)"^).   Then  the  smallest  ||87"||2 
such  that  r+  6r  is  singular  is  bounded  by 

j^  s  min  ||5r||2    . 

If  in  addition  either  k  =  tii  or  both  7""  and  T^^  are  positive  semidefinite  (negative  semidefin- 
ite), then 

min  ||8r||2  ^  —    . 


&Tu 

BJJ,    0 

67-2, 

0      0 

0 

0      0 
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Again,  the  proof  is  a  minor  modification  of  the  proof  of  Theorem  5. 
Finally,  we  consider  the  block  diagonal  perturbation 

67-1,      0         0 

0        hT22        0 


hT  = 


0 


0      hT 


33 


(6.2) 


where  hT  is  conformally  partitioned  to  T.  This  time,  our  lower  bound  technique  based  on 

Lemma  1  yields  a  lower  bound  that  is  not  always  attainable: 

Theorem    10:    Let    T,    7"'    and    57"    be    as    in    (6.1)    and    (6.2).     Let 

||diag(X,y)||  =  max(||X||,||y||).  Let 

K,    =    max  (t,,,T22,T33,(ti2T2i)"^,(ti3T3i)"^,(t23T32)"    ) 


T/;=r^| 


where 


and 


K2  =  max  ( 


^23^31       T13T32      T32T21       T12T23      T31T12      T21T13 


)    = 


max 

ij.k  distinct 


T21  T12  T31  Ti3  T32  T23 

Then  the  smallest  ||B7||  such  that  7  +  67  is  singular  is  bounded  by 

\ <  min  |I67||  ^  -^ 

9  max  (ki.kj)  ^\ 

where  -y  s  3"^.  Note  that  if  k2»ki,  the  lower  bound  will  be  much  smaller  than  the  upper 
bound. 

Proof:  The  upper  bound  comes  from  Theorem  6  and  considering  perturbations  to  any  two  of 
the  67„  alone.  The  lower  bound  comes  from  applying  Lemma  1  to  k  =  max(K,,K2):  To  first 
order  (7  +  87)"'  =  7~'- 7"'677~'.  From  7~'677~'  we  get  first  order  bounds  on  the  per- 
turbations of  all  the  terms  in  k,  and  show  that  to  first  order  the  change  in  k  is  bounded  by 

Bk  s  9k2  ||57||    . 

Using  Lemma  1,  we  conclude  min  ||67||  a  1/(9k).  d 

If  some  T,y,  ii'j  is  small  or  zero,  then  K2  may  be  very  large  or  infinite,  and  the  lower 


bound  will  be  much  too  small,  even  smaller  than  the  trivial  lower  bound  117 


-ii 


However, 


it  is  sometimes  close  to  the  upper  bound  (and  larger  than  ||7 
information,  as  in  the  following  3  by  3  example: 

7-1  = 


-iii-i 


),  and  so  supplies  useful 


1 

x2 

X 

1 

1 

1 

1 

X 

1. 

where  x»l.  In  this  example  kj  =  K2  =  j:<jc^~||7    '||. 

Just  as  in  Theorems  6  through  9,  Theorem  10  can  be  extended  to  the  case  of  perturbing 
three  diagonal  subblocks  of  a  block  4  by  4  matrix. 

Note  that  for  Theorems  6,  8  and  10,  the  lower  bound  for  the  2-norm  ||-||2  is  also  a  lower 
bound  for  the  Frobenius  norm  ||||f,  and  that  2"^  times  the  upper  bound  for  the  2-norm  is  an 
upper  bound  for  the  Frobenius  norm,  since  the  perturbation  attaining  the  upper  bound  is  at 
most  rank  2.  In  Theorems  7  and  9,  the  upper  and  lower  bounds  for  the  2-norm  are  also 
upper  and  lower  bounds  for  the  Frobenius  norm. 
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7.  Estimating  Norms  of  Sublocks  of  T    ' 

The  estimates  in  this  paper  depend  on  the  norms  of  sublocks  of  T"'.  It  therefore 
appears  that  computing  these  bounds  is  as  expensive  as  computing  the  inverse  of  T.  However, 
provided  it  is  inexpensive  to  solve  Tx  =  b  and  T^y  =  c  for  various  right  hand  sides  b  and  c 
(using,  for  example,  an  existing  LU  factorization  of  7"),  it  is  not  much  more  expensive  to  esti- 
mate the  norm  of  any  subblock.  Therefore,  the  bounds  in  this  paper  may  be  computed  in 
practice  even  for  large  sparse  matrices  whose  inverses  would  be  dense,  as  long  as  we  can 
solve  linear  systems  efficiently. 

Our  technique  depends  on  an  algorithm  of  Hager  [Hager,  Higham]  which  can  estimate 
the  1-norm  or  <»-norm  of  a  matrix  B  given  the  ability  to  compute  Bx  and  B^y  for  any  x  and  y. 
Suppose  without  loss  of  generality  that  the  submatrix  of  7""'  whose  1-norm  or  oo-norm  we 
want  to  estimate  lies  in  the  leading  r  rows  and  c  columns  of  r~'.  Let  E^  be  a  diagonal  matrix 
with  I's  in  diagonal  positions  1  through  r  and  zeros  elsewhere,  and  define  £<-  analogously. 
Then  what  we  want  to  estimate  is  ||£rr~'£<-||.  Using  Hager's  method,  we  only  need  to  be  able 
to  multiply  any  vector  by  either  E^T'^Ec  or  EcT'^E^;  both  multiplications  are  inexpensive 
given  an  LU  factorization  of  T  or  other  inexpensive  method  for  solving  Tx  =b  and  T^y  =  c. 

Hager's  algorithm  also  supplies  vectors  u  and  v'^  such  that  v^Bu  is  nearly  as  large  as 
possible.  These  vectors  can  be  used  to  construct  perturbations  nearly  satisfying  the  upper 
bounds  in  the  previous  sections,  or  as  starting  vectors  in  iterations  designed  to  compute  the 
smallest  perturbations  more  accurately. 


8.  Conclusions  and  Future  Work 

We  have  shown  how  to  estimate  the  norm  of  the  smallest  block  structured  additive  per- 
turbation of  a  block  2  by  2  matrix  that  makes  it  singular.  Our  estimates  are  accurate  to  within 
a  factor  of  at  most  3'  ^  (a  factor  of  3  for  real  matrices),  and  work  for  all  possible  block  per- 
turbations of  a  block  2  by  2  matrix  and  for  a  large  class  of  matrix  norms,  including  all  p- 
norms  and  the  Frobenius  norm.  Using  an  algorithm  of  Hager,  we  may  inexpensively  estimate 
our  bounds  even  for  large  matrices.  We  explicitly  exhibit  rank  one  or  rank  two  perturbations 
which  achieve  our  upper  bounds.  These  explicit  perturbations  may  be  used  as  starting  values 
for  an  optimization  routine  designed  to  compute  the  answer  to  higher  accuracy  than  our  a 
priori  estimates  provide.  These  results  extend  to  some  block  perturbations  of  3  by  3  block 
matrices,  dthough  the  upper  and  lower  bounds  may  not  always  be  close. 

Our  technique,  based  on  differential  inequalities  introduced  in  [Demmel87a],  can  be 
applied  to  other  block  perturbations  not  discussed  in  this  paper.  It  is  an  open  question  as  to 
whether  it  can  be  used  to  get  tight  upper  and  lower  bounds  for  more  complicated  structures 
than  those  addressed  here,  as  well  as  for  nonsquare  matrices;  this  is  future  work. 
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